Differentially private false discovery rate control
نویسندگان
چکیده
Differential privacy provides a rigorous framework for privacy-preserving data analysis. This paper proposes the first differentially private procedure controlling false discovery rate (FDR) in multiple hypothesis testing. Inspired by Benjamini-Hochberg (BHq), our approach is to repeatedly add noise logarithms of p-values ensure differential and select an approximately smallest p-value serving as promising candidate at each iteration; selected are further supplied BHq releases only rejected ones. Moreover, we develop new technique that based on backward submartingale proving FDR control broad class testing procedures, including procedure, both step- up step-down procedures. As novel aspect, proof works arbitrary dependence between true null test statistics, while maintained small multiplicative factor.
منابع مشابه
Private False Discovery Rate Control
We provide the first differentially private algorithms for controlling the false discovery rate (FDR) in multiple hypothesis testing, with essentially no loss in power under certain conditions. Our general approach is to adapt a well-known variant of the Benjamini-Hochberg procedure (BHq), making each step differentially private. This destroys the classical proof of FDR control. To prove FDR co...
متن کاملIdentifying differentially expressed genes using false discovery rate controlling procedures
MOTIVATION DNA microarrays have recently been used for the purpose of monitoring expression levels of thousands of genes simultaneously and identifying those genes that are differentially expressed. The probability that a false identification (type I error) is committed can increase sharply when the number of tested genes gets large. Correlation between the test statistics attributed to gene co...
متن کاملDiscrete False-Discovery Rate Improves Identification of Differentially Abundant Microbes
Differential abundance testing is a critical task in microbiome studies that is complicated by the sparsity of data matrices. Here we adapt for microbiome studies a solution from the field of gene expression analysis to produce a new method, discrete false-discovery rate (DS-FDR), that greatly improves the power to detect differential taxa by exploiting the discreteness of the data. Additionall...
متن کاملOn Online Control of False Discovery Rate
Multiple hypotheses testing is a core problem in statistical inference and arises in almost every scientific field. Given a sequence of null hypotheses H(n) = (H1, . . . ,Hn), Benjamini and Hochberg [BH95] introduced the false discovery rate (FDR), which is the expected proportion of false positives among rejected null hypotheses, and proposed a testing procedure that controls FDR below a pre-a...
متن کاملFalse Discovery Rate Control With Groups.
In the context of large-scale multiple hypothesis testing, the hypotheses often possess certain group structures based on additional information such as Gene Ontology in gene expression data and phenotypes in genome-wide association studies. It is hence desirable to incorporate such information when dealing with multiplicity problems to increase statistical power. In this article, we demonstrat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The journal of privacy and confidentiality
سال: 2021
ISSN: ['2575-8527']
DOI: https://doi.org/10.29012/jpc.755